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55. (Amended) A te^it for determining if BS322 antigen or anti-BS322 
antibody is present in a test sample, sahd^it comprising: 

a container containing at leastsone BS322 polypeptide having at least 95% 
identity over the entire length of a sequence selected from the group consisting of 
SEQUENCE ID NO: 25, SEQUENCE ID NO: 26, S^UENCE ID NO: 27, and 
SEQUENCE ID NO: 28. . 

REMARKS 

The Examiner has maintained the rejection under 35 U.S.C. § 101 and 35 U.S.C. 
§ 112, first paragraph. Applicant respectfully disagrees. 

BS322 is a tissue specific putatine transcription factor in breast tissue found by 
SEREX (Serological analysis of recombinant tumor cDNA expression libraries). 
Applicant submits Exhibit A, ["Homo sapiens similar to breast cancer antigen NY-BR-1 
(LOC91074), mRNA"] and Exhibit B, [Jager, D.^et a/., "Identification of a Tissue- 
specific Putative Transcription Factor in Breast Tissue by Serological Screening of a 
Breast Cancer Library", Cancer Research, 61:2055-2061 (2001)], to illustrate this point. 
Exhibit A shows the homology between BS322 and the molecule NY-BR-1 . As 
evidenced from this Exhibit, BS322 and NY-BR-1 are the same molecule. Exhibit B 
shows that NY-BR-1 (BS322) is found in breast cancer tissues. Specifically, NY-BR-1 
(BS322) is found in 21 of 25 breast cancers but only in 2 of 82 non-manmiary tumors. 
Thus, BS322 clearly has utility as a diagnostic tool for the detection of breast cancer. 

The Examiner further rejects claims 52-61 under 35 U.S.C. § 1 12, first paragraph, 
due to the "90% identity" language in the claims. Applicant has raised the percent 
identity to 95%. 

Applicant further submits the software manual to the Wisconsin Sequence 
Analysis program, Version 8, publicly available from Genetics Computer Group, 
Madison, WI, as Exhibit C. Support for this submission is found on page 1 6, beginning 
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line 7. The manual provides the algorithm, parameters, parameter values and other 
information necessary to, accurately and consistently, calculate the percent identity. This 
manual indicates on pages 5-21, inter olia, that the software used the local homology 
algorithm of Smith and Waterman (Advances in Applied Mathematics 2; 482-489 
(1981)). 

The Examiner further rejects claims 62-69, 72-76 and 80 under 35 U.S.C. § 1 12 
paragraph 2 due to the "epitope" language percent in the claims. 

The methods for identifying epitopes in a novel peptide sequence are well known 
and described in both the scientific, commercial, and patent literature. For example, M. 
H. Van Regenmortel describes how to predict epitopes from the primary sequence of a 
protein. (See "Protein structure and antigenicity", IntJRadAppl Instrum B., \A{^ym- 
80, 1987.) 

Further, Perkin-Elmer Biosystems, a major provider of DNA sequencing and 
peptide synthesizing instruments has established a public website which describes how to 
select peptides which reflect the epitopes of a protein. (See 

http://www.pebio.com/pa/3409 1 3/html/chapt2.html#Choosing the Epitope.) This 
electronic publication was posted in 1996 and basically describes the process employed 
by the inventors of the current patent application. 

In addition, patent application PCT/US97/00485 describes in detail how to 
identify epitopes from peptide sequences. The sequence can be scanned for 
hydrophobicity and hydrophilicity values by the method of Hopp, Prog. Clin. Biol. Res. 
172B: 367-377 (1985) or the method of Cease et al, J. Exp. Med, 164: 1779-1784 (1986) 
or the method of Spouge et al, J. Immunol. 138: 204-212 (1987). Commercial software 
programs to implement these methods are available. 
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CONCLUSION 



In view of the aforementioned remarks. Applicant respectfully submits that the 
above-referenced application is now in a condition for allowance and Applicant 
respectfully requests that the Examiner withdraw all outstanding objections and rejections 
and passes the application to allowance. 
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Version with Markings to Show Changes Made 

52. (Amended) A [BS322] purified polypeptide, having at least [90%] 95% 
identity over the entire length of a sequence selected from the group consisting of [SEQ 
ID NOS: 25-28] SEQUENCE ID NO: 25, SEQUENCE ID NO: 26, SEQUENCE ID NO: 
27, and SEQUENCE ID NO: 28. 



55. (Amended) A test kit for determining if BS322 antigen or anti-BS322 
antibody is present in a test sample, said kit comprising: 

a container containing at least one BS322 polypeptide having at least 
[90%] 95% identity over the entire length of a sequence selected from the group 
consisting of [SEQ ID NOS: 24-28] SEQUENCE ID NO: 25, SEQUENCE ID NO: 26, 
SEQUENCE ID NO: 27, and SEQUENCE ID NO: 2& 



EXHIBIT A 

RECEIVED 

WiAR 1 3 ZOO^ 

TECH CENTER 1600/2900 

Alignments 

> c;ri 1 16156644 I ref |XM 035844.31 Homo sapiens similar to breast cancer antigen 
NY-BR-1 (LOC91074), 
mRNA 
Length '= 4408 

Score = 2266 bits (1143), Expect =0.0 
Identities = 1143/1143 (100%) 
Strand = Plus / Plus 

Query: 1197 aggtttctcacactcatgaaaatgaaaattatctcttacatgaaaattgcatgttgaaaa 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I N I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
Sbjct : 3200 aggtttctcacactcatgaaaatgaaaattatctcttacatgaaaattgcatgttgaaaa 3259 



Query: 1257 aggaaattgccatgctaaaactggaaatagccacactgaaacaccaataccaggaaaagg 1316 

I I I I I I I I I I I I I I i I 1 I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I 
Sbjct: 3260 aggaaattgccatgctaaaactggaaatagccacactgaaacaccaataccaggaaaagg 3319 



Query: 1317 aaaataaatactttgaggacattaagattttaaaagaaaagaatgctgaacttcagatga 1376 

I M I I I I I I I I I I I I I I I I I I I I M I I I 1 I I I I I I I i I I I I I I I I I I i i I I I I I I I I I I I 
Sbjct: 3320 aaaataaatactttgaggacattaagattttaaaagaaaagaatgctgaacttcagatga 3379 



Query: 1377 ccctaaaactgaaagaggaatcattaactaaaagggcatctcaatatagtgggcagctta 1436 

I I I I I i I I I I I I I I I I I i I I I i I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 3380 ccctaaaactgaaagaggaatcattaactaaaagggcatctcaatatagtgggcagctta 3439 



Query: 1437 aagttctgatagctgagaacacaatgctcacttctaaattgaaggaaaaacaagacaaag 14 96 

I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I M I I I I I I I I I I i i I I I I i i I I I I I I I I I I 
Sbjct: 3440 aagttctgatagctgagaacacaatgctcacttctaaattgaaggaaaaacaagacaaag 3499 



Query: 14 97 aaatactagaggcagaaattgaatcacaccatcctagactggcttctgctgtacaagacc 1556 

IMIIIIMIIillllllllllllllllllllllllllillllllllllllilillllll 
Sbjct: 3500 aaatactagaggcagaaattgaatcacaccatcctagactggcttctgctgtacaagacc 3559 



Query: 1557 atgatcaaattgtgacatcaagaaaaagtcaagaacctgctttccacattgcaggagatg 1616 

I i I I I I M I I I I I I I I I I I I I I i I M I I I I I I I I M I I I I I I I I 1 I I M I I I I I I I I I I I 
Sbjct: 3560 atgatcaaattgtgacatcaagaaaaagtcaagaacctgctttccacattgcaggagatg 3619 



Query: 1617 cttgtttgcaaagaaaaatgaatgttgatgtgagtagtacgatatataacaatgaggtgc 1676 

I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 3620 cttgtttgcaaagaaaaatgaatgttgatgtgagtagtacgatatataacaatgaggtgc 3679 



Query: 1677 tccatcaaccactttctgaagctcaaaggaaatccaaaagcctaaaaattaatctcaatt 1736 

I I I M I I I I I I I I I I I I I M I M I I I I I I I I M I M I I I I I I I I i I I I I I I M I I I I I I I 
Sbjct: 3680 tccatcaaccactttctgaagctcaaaggaaatccaaaagcctaaaaattaatctcaatt 3739 



Query: 1737 atgcaggagatgctctaagagaaaatacattggtttcagaacatgcacaaagagaccaac 1796 

IIIIIIIIIIIIIIIMIIIIIIIIIIIillMllllllilllllMIIIIIIIIIIIII 
Sbjct: 3740 atgcaggagatgctctaagagaaaatacattggtttcagaacatgcacaaagagaccaac 3799 



Query: 1797 gtgaaacacagtgtcaaatgaaggaagctgaacacatgtatcaaaacgaacaagataatg 1856 

I I I M I I I I I I I I i i I I I I I I I I I I i I I I I I I I I I I I I I M I i M I I I I I i i I I I I I i I I 
Sbjct: 3800 gtgaaacacagtgtcaaatgaaggaagctgaacacatgtatcaaaacgaacaagataatg 3859 



Query: 1857 tgaacaaacacactgaacagcaggagtctctagatcagaaattatttcaactacaaagca 1916 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIiillllllllMIIIIM 
Sbjct: 3860 tgaacaaacacactgaacagcaggagtctctagatcagaaattatttcaactacaaagca 3919 



Query: 1917 aaaatatgtggcttcaacagcaattagttcatgcacataagaaagctgacaacaaaagca 1976 

I I I I M I I I I I I I I I I I I I I I I M I I I I I I i I I I i I M I I I I I I i I I I I I I I M I I I I I I 
Sbjct: 3920 aaaatatgtggcttcaacagcaattagttcatgcacataagaaagctgacaacaaaagca 3979 



Query: 1977 agataacaattgatattcattttcttgagaggaaaatgcaacatcatctcctaaaagaga 2036 

lllllllllllllllllllllilllllllllllllllllllMIIIIIIIIIIIIIIIII 
Sbjct: 3980 agataacaattgatattcattttcttgagaggaaaatgcaacatcatctcctaaaagaga 4 039 



Query: 2037 aaaatgaggagatatttaattacaataaccatttaaaaaaccgtatatatcaatatgaaa 2096 

MIIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMMIilllllM 
Sbjct: 4040 aaaatgaggagatatttaattacaataaccatttaaaaaaccgtatatatcaatatgaaa 4099 



Query: 2097 aagagaaagcagaaacagaaaactcatgagagacaagcagtaagaaacttcttttggaga 2156 

ililllllllllllllllllllllllllllllllllllllllllllllilMIIIIIIII 
Sbjct: 4100 aagagaaagcagaaacagaaaactcatgagagacaagcagtaagaaacttcttttggaga 4159 



Query: 2157 aacaacagaccagatctttactcacaactcatgctaggaggccagtcctagcatcacctt 2216 

II i I I I I I I I I I I 11 I I I I I i I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
Sbjct: 4160 aacaacagaccagatctttactcacaactcatgctaggaggccagtcctagcatcacctt 4219 



Query: 2217 atgttgaaaatcttaccaatagtctgtgtcaacagaatacttattttagaagaaaaattc 2276 

I I I I I I i I I i I i I I I I I I I I I i I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Sbjct: 4220 atgttgaaaatcttaccaatagtctgtgtcaacagaatacttattttagaagaaaaattc 4279 



Query: 2277 atgatttcttcctgaagcctacagacataaaataacagtgtgaagaattacttgttcacg 2336 

I M I I I I I I I I I I i I I M I I I I I I I I I I i I I I M I i I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 4280 atgatttcttcctgaagcctacagacataaaataacagtgtgaagaattacttgttcacg 4339 



Query: 2337 aat 2339 
I I I 

Sbjct: 4340 aat 4342 



'> qi 1 134 69728 I qb | AF269087 , 1 1 AF269087 Homo sapiens breast cancer antigen NY-BR-1 
mRNA, coinplete cds 

Length = 4458 

Score = 2266 bits (1143), Expect =0.0 
Identities = 1143/1143 (100%) 
Strand = Plus / Plus 

Query: 1197 aggtttctcacactcatgaaaatgaaaattatctcttacatgaaaattgcatgttgaaaa 1256 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j I I I M M I I I I I I I I I I I I I 1 1 I I I I I I 
Sbjct: 3197 aggtttctcacactcatgaaaatgaaaattatctcttacatgaaaattgcatgttgaaaa 3256 

Query: 1257 aggaaattgccatgctaaaactggaaatagccacactgaaacaccaataccaggaaaagg 1316 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 n I I M 

Sbjct : 3257 aggaaattgccatgctaaaactggaaatagccacactgaaacaccaataccaggaaaagg 3316 

Query: 1317 aaaataaatactttgaggacattaagattttaaaagaaaagaatgctgaacttcagatga 1376 

I I I i I I I I I I I I I I I I I i I I I I I I I I I I I 1 I I M I I I I I I I I I I 1 I I I I I I 1 I I I I I I I I 
Sbjct: 3317 aaaataaatactttgaggacattaagattttaaaagaaaagaatgctgaacttcagatga 3376 

Query: 1377 ccctaaaactgaaagaggaatcattaactaaaagggcatctcaatatagtgggcagctta 1436 

I I I M I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Sbjct: 3377 ccctaaaactgaaagaggaatcattaactaaaagggcatctcaatatagtgggcagctta 3436 

Query: 1437 aagttctgatagctgagaacacaatgctcacttctaaattgaaggaaaaacaagacaaag 14 96 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 3437 aagttctgatagctgagaacacaatgctcacttctaaattgaaggaaaaacaagacaaag 3496 

Query: 14 97 aaatactagaggcagaaattgaatcacaccatcctagactggcttctgctgtacaagacc 1556 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Sbjct: 3497 aaatactagaggcagaaattgaatcacaccatcctagactggcttctgctgtacaagacc 3556 

Query: 1557 atgatcaaattgtgacatcaagaaaaagtcaagaacctgctttccacattgcaggagatg 1616 

I I I I I I I I I I I I I I M I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I i I i I M I I I I I 
Sbjct: 3557 atgatcaaattgtgacatcaagaaaaagtcaagaacctgctttccacattgcaggagatg 3616 



Query: 



1617 



cttgtttgcaaagaaaaatgaatgttgatgtgagtagtacgatatataacaatgaggtgc 167 6 
I M i I I I I I I I M I I I I I I I I I I I I I I I M M I I I I I I I M I I I I I I I I I I i I I I I I I M 



Sbjct : 3617 cttgtttgcaaagaaaaatgaatgttgatgtgagtagtacgatatataacaatgaggtgc 3676 

Query: 1677 tccatcaaccactttctgaaqctcaaaqgaaatccaaaagcctaaaaattaatctcaatt 1736 

111111111111111111111111111111111111111111111111111111111111 
Sbjct : 3677 tccatcaaccactttctgaagctcaaaggaaatccaaaagcctaaaaattaatctcaatt 3736 

Query: 1737 atgcaggagatgctctaagagaaaatacattggtttcagaacatgcacaaagagaccaac 1796 

I I I I I I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 3737 atgcaggagatgctctaagagaaaatacattggtttcagaacatgcacaaagagaccaac 3796 

Query: 1797 gtgaaacacagtgtcaaatgaaggaagctgaacacatgtatcaaaacgaacaagataatg 1856 

I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct : 3797 gtgaaacacagtgtcaaatgaaggaagctgaacacatgtatcaaaacgaacaagataatg 3856 

Query: 1857 tgaacaaacacactgaacagcaggagtctctagatcagaaattatttcaactacaaagca 1916 

i I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Sbjct: 3857 tgaacaaacacactgaacagcaggagtctctagatcagaaattatttcaactacaaagca 3916 

Query: 1917 aaaatatgtggcttcaacagcaattagttcatgcacataagaaagctgacaacaaaagca 1976 

. I I I I I I I I M I I I I I I I i I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 3917 aaaatatgtggcttcaacagcaattagttcatgcacataagaaagctgacaacaaaagca 3976 

Query: 1977 agataacaattgatattcattttcttgagaggaaaatgcaacatcatctcctaaaagaga 2036 

I I i I I I I I i I I I I I I I I I I I I I I I I M I I I I I I I I M M i I I I I I I I I I I I M I I i I I I I 
Sbjct: 3977 agataacaattgatattcattttcttgagaggaaaatgcaacatcatctcctaaaagaga 4036 

Query: 2037 aaaatgaggagatatttaattacaataaccatttaaaaaaccgtatatatcaatatgaaa 2096 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 4037 aaaatgaggagatatttaattacaataaccatttaaaaaaccgtatatatcaatatgaaa 4096 

Query: 2097 aagagaaagcagaaacagaaaactcatgagagacaagcagtaagaaacttcttttggaga 2156 

I I I I I I I I I I I i I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 4097 aagagaaagcagaaacagaaaactcatgagagacaagcagtaagaaacttcttttggaga 4156 

Query: 2157 aacaacagaccagatctttactcacaactcatgctaggaggccagtcctagcatcacctt 2216 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct: 4157 aacaacagaccagatctttactcacaactcatgctaggaggccagtcctagcatcacctt 4216 

Query: 2217 atgttgaaaatcttaccaatagtctgtgtcaacagaatacttattttagaagaaaaattc 2276 

I I I I I I I I I I I I I I I I i I I I I I I I I I M I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I I 
Sbjct: 4217 atgttgaaaatcttaccaatagtctgtgtcaacagaatacttattttagaagaaaaattc 4276 

Query: 2277 atgatttcttcctgaagcctacagacataaaataacagtgtgaagaattacttgttcacg 2336 

M I I I I I i I I I I I I i I I I I I I I 1 I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Sbjct: 4277 atgattt cttcctgaagcctacagacataaaataacagtgtgaagaattacttgttcacg 4336 
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ABSTRACT 

Application of SEREX (serological analysis of recombinant tumor 
cDNA expression libraries) to different tumor types has led to tbe iden- 
tification of several categories of human tumor antigens. In this study, the 
analysis of a breast cancer library with autologous patient serum led to the 
isolation of seven genes, designated NY-BR-1 through NY-BR-J. NY-BR-I, 
representing 6 of 14 clones isolated, showed tissue-restricted mRNA ex- 
pression in breast and testis but not in 13 other normal tissues tested. 
Among tumor specimens, NY-BR-1 mRNA expression was found in 21 of 
25 breast cancers but in only 2 of 82 nonmammary tumors. Structural 
analysis of NY-BR-l cDNA and the corresponding genomic sequences in 
tbe recentiy released working draft of human genome indicated that 
NY-BR-1 is composed of 37 exons and has an open reading frame of 
4.0-4.2 kb, encoding a peptide of A/^ 150,000-160,000. A bipartite nuclear 
localization signal motif indicates a nuclear site for NY-BR-1, and the 
presence of a bZIP site (DNA-binding site followed by leucine zipper 
motif) suggests that NY-BR-1 Ss a transcription factor. Additional struc- 
tural features include five tandem ankyrin repeats, implying a role for 
NY-BR-1 in protein-protein interactions. NY-BR-1 thus represents a 
breast tissue-specific putative transcription factor with autoimmunogenic- 
ity in breast cancer patients. In addition to NY-BR-1, a homologous gene, 
NY-BR-IJ, was identified in this study. NY-BR-l.J shares 54% amino acid 
homology with NY-BR-1 and also shows tissue-restricted mRNA expres- 
sion. However, unlike NY-BR-l, NY-BR~L1 mRNA Is expressed in brain, 
In addition to breast and testis. The exon structure of NY-BR-J. J remains 
to be defined. Using human genome database, NY-BR-1 was localized to 
chromosome 10pll-pl2, and NY~BR-L1 was tentatively localized to chro- 
mosome 9. 

INTRODUCTION 

Whether immunological factors play a role in the development, 
growth, and progression of human breast cancer remains a critical 
unresolved issue. The lymphocyte infiltrates fi-equently associated 
with breast cancer (1-5), particularly the intense T- and B-cell infil- 
trates in medullary carcinoma (6-8), and the reactive changes in the 
draining lymph nodes of breast cancer patients (9, 10) are consistent 
with the idea of immune recognition in breast cancer. However, 
efforts to relate the lymphocyte infiltrate and lymph node changes 
with prognosis have not yielded conclusive evidence for such an 
association (11). The search for breast cancer antigens that elicit 
humoral or cellular immune reactions in breast cancer patients also 
has a long history, fi*om evidence for immune responses against tiie 
murine mammary tumor virus (12) and delayed hypersensitivity and 
humoral immunity against T/Tn antigens (13), to more recent findings 
of antibody and T-cell responses to p53 (14) and HER-2/neu (15, 16). 

One major challenge confi-onting the analysis of autologous im- 
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mune responses in breast cancer, however, is the well-recognized 
difficulty of establishing breast cancer cell lines as targets for immu- 
nological analysis. This is in contrast to the relative ease of establish- 
ing lines from melanoma, renal cancer, and other tumor types. For this 
reason, the analysis of the human T-cell response against melanoma 
and the molecular identification of the antigens eliciting these re- 
sponses are far more advanced in melanoma (17-19) than in breast 
cancer. 

The recent development of SEREX,** a general method to analyze 
the humoral immtme response of cancer patients that does not require 
autologous tumor cell lines, provides a powerful new way to dissect 
the immune response to breast cancer. Our initial application of 
SEREX to breast cancer led to the identification of pSBFNGl, encoded 
by a putative tumor suppressor gene" in breast cancer, as an immuno- 
genic breast cancer antigen. In addition, CT antigens, shown previ- 
ously to be immtmogenic antigens in other tumor types, were identi- 
fied (20). In the present study, we have continued our effort to define 
breast cancer antigens by SEREX. Of the panel of antigens identified, 
a highly restricted breast autoinunimogenic differentiation antigen, 
NY-BR-1, was identified and characterized. 

MATERIALS AND METHODS 

Tumor Tissue and Cell Lines. The BR 17 tumor sample was derived from 
a s.c. metastasis of a 60-year-old female patient at Krankenhaus Nordwest. The 
patient had an unusually favorable history with metastatic ductal carcinoma of 
the breast. Breast cancer cell lines and cell lines of other tumor types were 
obtained from the repository maintained at the Ludwig Institute for Cancer 
Research, New York Branch at the Memorial Sloan-Kettering Cancer Center. 
Tumor tissues were obtained from the Departments of Pathology at The New 
Yoiic Presbyterian Hospital and the Memorial Sloan-Kettering Cancer Center. 

RNA Extraction and Construction of cDNA Expression library. Total . 
RNA was extracted firom the BR 17 breast cancer sample by conventional 
CsCl-guanidine thiocyanate gradient method. A cDNA library was constructed 
in a A-ZAP Express vector, using a commercial cDNA library kit (Stratagene), 

Immunoscreening of the cDNA Library. The unamplified cDNA expres- 
sion libraiy was screened with the autologous serum at 1:200 dilution. The 
screening procedure was as described previously (21). Briefly, the serum was 
diluted 1:10, preabsorbed with phage-transfected Escherichia coli lysate, fur- 
ther diluted to 1:200, and incubated overnight at room temperature with the 
nitrocellulose membranes (Schleicher & Schuell) containing the phage plaques 
at a density of 4000-5000 pfu/130-mm plate. After washing, the filters were 
incubated with alkaline phosphatase-conjugated goat antihuman Fey second- 
ary antibodies, and the reactive phage plaques were visualized by incubating 
with 5-bromo-4-chloro-3-indolyl-phosphate and nitroblue tetrazolium. 

Sequence Analysis of the Reactive Clones. The reactive clones were 
subcloned, piuified, and in vivo excised to pBK-CMV plasmid forms (Strat- 
agene). Plasmid DNA was prepared by using the Wizard Miniprep DNA 
Purification System (Promega). The inserted DNA was evaluated by £coRI- 
Xba\ restriction mapping, and clones representing different cDNA inserts were 
sequenced. The sequencing reactions were performed by the DNA Sequencing 



^Thc abbreviations used are: SEREX, serological analysis of recombinant tumor 
cDNA expression libraries; CT, cancer testis; EST, expressed sequence tag; RT-PCR, 
reverse transcription-PCR; RACE, rapid amplification of cDNA ends; ORF, open reading 
frame; CD, cluster of differentiation; pfu, plaque^fomiing unit 
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Service at Cornell University (Ithaca, NY) using Applied Biosystems PRISM 
(Perkin-Elmer) automated sequencers. DNA and amino acid sequences were 
compared with sequences in the GenBank and the EST databases using the 
BLAST program. Genes identical to entries in the GenBank were classified as 
known genes, whereas those that shared sequence identity only to ESTs and 
those which have no identity in either GenBank or EST databases were 
designated as unknown genes. 

RT-PCR. To evaluate the mRNA expression pattern of the cloned cDNA 
in normal and malignant tissues, total RNA was extracted from breast cancer 
cell lines and tumor specimens by the conventional CsCl-guanidine thiocya- 
nate gradient method, and normal tissue RNA was obtained commercially 
(Clontech). Gene-specific oligonucleotide primers were designed to amplify 
cDNA segments of 300-600 bp in length, with the estimated primer melting 
temperature in the range of 65-70°C (see Figs. 2 and 4 for specific primer 
•sequences). All primers were synthesized commercially (Operon Technolo- 
gies, Alameda, CA). RT-PCR was performed using 30 amplification cycles in 
a thermal cycler (Perkin-Elmer) at an annealing temperature of SQ^Cy and the 
products were analyzed by 1.5% gel electrophoresis and ethidium bromide 
visualization. 

Rapid Amplification of cDNA Ends. RACE reactions (5 '-RACE and 
3 '-RACE) were performed using gene-specific and adaptor-specific primers in 
conjunction with Marathon-Ready normal testis cDNA and AmpliTaq Gold 
polymerase (Perkin-Elmer). Products were ligated into the PCR-direct cloning 
vector pGEMT plasmid and analyzed by restriction mapping and sequencing. 

Hybridization Screening of a Testicular Library. A commercially ob- 
tained testis cDNA expression library (Stratagene) was screened using a 
NY-BR-1 PCR product as a probe (sec Fig. 2 for primer sequences), as 
described in the Stratagene manual. Briefly, a total of 5 X 10^ pfu/lSO-mm 
plate were transferred to nitrocellulose membranes (Schleicher & Schuell), the 
membranes were submerged in denaturation solution (1.5 M NaCt and 0.5 M 
NaOH) for 5 min, transferred into neutralization solution (1.5 M NaCl and 0.5 
M Tris-HCl) for 5 min, and then rinsed in 0.2 M Tris-HCl and 2X SSC. The 
membranes were hybridized to a ^^P-labeled DNA probe at high stringency 
condition (68**C, aqueous buffer) and washed at high stringency condition. 
Positive clones were subcloned, purified, and in vivo excised to pBK.-CMV 
plasmid forms as described above. 

RESULTS 

A total of 7 X 10^ pfu from the BR17 cDNA library were screened 
using autologous BR 17 serum at 1:200 dilution. Fourteen reactive 
clones were purified and sequenced. Comparison to GenBank and 
EST database revealed that these 14 clones were derived from seven 
distinct genes, two known and five unknown. These genes, designated 
NY-BR-1 through NY-BR-l, are described in Table 1. Four clones 
were derived from the two known genes, PBK-l (BR17-76, BR17- 
118, and BR17-137) and r/-22 7 (BRl 7- 100). PBK-2 and TI'227 zx^ 
universally expressed genes, because ESTs derived from these two 
genes have been reported in many different normal tissues. Of the 
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Clone 


GenBank 


Expression profile 


NY-BR-1 


BRI 7-128 


Unknown 


Expressed in nornial breast and testis only 








(RT-PCR) 




BR17-]a 








BR] 7-8 








BR17-35b 








BR17-44a 








BR17-44b 






NY-BR-2 


BR] 7-76 


PBK-! 


EST: ubiquitous 




BRI7-118 








BR17-137 






NY-BR-3 


BRI 7- J GO 


TI-227 


EST: muscle, colon* cndothctium. 








pancreas 


NY.BR-4 


BRI 7-9 !b 


Unknown 


EST: retina, cortex, fetal liver, spleen 


NY-BR-S 


BR»7-1!5 


Unknown 


EST: uterus, melanocytes, fetal heart 


NY-BR-6 


BR17-n7 


Unknown 


EST: tonsills, uterus, melanocytes, fetal 








heart 


NY-BR-7 


BR17-144 


Unknown 


EST: uterus, melanocytes, fetal heart 



remaining clones, NY-BR-4 through NY-BR-7, represented by one 
clone each, were also universally expressed based on comparison to 
EST databank entries. The six remaining clones, BR17-la, BRI 7-8, 
BR17-35b. BR17-44a, BR17-44b, and BRI7-128, were derived from 
the same unknown gene, NY-BR-I. Three matching cDNA sequences 
for NY-BR-1 were found in the EST database, two derived from 
breast cancer (accession numbers AI951 118 and AW373574), and the 
third (accession number AWl 70035) derived from a pooled tissue 
source (testis, fetal lung, and B cell), suggesting a possible tissue- 
restricted expression of NY-BR-1 mRNA (see below). 

Structural Analysis of NY-BR-1 cDNA. Compilation of the six 
NY-BR-1 cDNA clones revealed a cDNA sequence of 1464 bp. 
Analysis showed a continuous ORF throughout this sequence, indi- 
cating that this is a partial cDNA sequence, truncated at both 5' and 
3' ends. Comparison with the EST entry AW 170035 (446 bp) re- 
vealed 100% sequence identity in the 89 bp overiapping the 5' 
sequence, with the EST entry extending 357 bp further in its 3' 
sequence than NY-BR-1 cDNA clones. Sequences of the other two 
EST entries (AI951118 and AW373574) are contained within NY- 
BR-1. Combining the EST sequence with the cloned NY-BR-1 se- 
quence allowed the definition of the translational termination codon, 
with a 3' untranslated region of 333 bp. 

To complete the missing 5' cDNA sequence, a testicular library was 
screened using a 7W-BR-1 PCR product as a probe. One of the clones 
isolated during this screening extended the 5' sequence of NY-BR-1 
1346 bp but did not provide a definite translation initiation site. On the 
basis of this cDNA sequence, a 5' RACE-PCR was performed, and 
the PCR product was cloned into the pGEMT plasmid vector and 
sequenced. This 5 '-RACE sequence extended the cDNA sequence 
1292 bp further 5', with the longest ORF starting at the ATG codon 
at position 100. No stop codon was found in the 99-bp 5' sequence, 
suggesting the possibility of additional 5' coding sequence in NY- 
BR-1. However, repeated 5 '-RACE using different nested-primer 
pairs and adaptor-ligated cDNA derived from different NY-BR-1 
mRNA-positive tissues (testis and breast, see below) failed to extend 
the 5' cDNA sequence further. 

The available NY-BR-1 cDNA has a 4125-bp coding sequence and 
a 333-bp 3 '-untranslated segment (submitted to GenBank, accession 
number AF269087). The predicted amino acid sequence from the 
possible ATG initiation codon (nucleotide position 100) is shown in 
Fig. 1. Motif analysis of the amino acid sequence using PROSITE and 
Pfam search programs identified a bipartite nuclear localization signal 
motif at amino acid position 17-34, suggesting that NY-BR-1 is a 
nuclear protein. Five tandem ankyrin repeats were also identified, 
located at amino acid positions 49-81, 82-114, 115-147, 148-180. 
and 1 8 1-2 1 3. The presence of a bZIP site (DNA-binding site followed 
by leucine zipper motif) at amino acid position 1077-1 104 suggests 
that this nuclear protein functions as a transcription factor. Of interest, 
three additional repetitive elements were identified located between 
the ankyrin repeats and the NHj-terminal bZIP DNA-binding site 
(Fig. 1). The first repetitive element, consisting of 357 nucleotides 
(1 19 amino acids), is tandemly repeated three times, spanning amino 
acid residues 459-815. The second repetitive sequence, consisting of 
repeats of 1 1 amino acids, is located between amino acids 224 and 300 
(seven repeats). The third repetitive sequence, consisting of only two 
repeats of 34 amino acids each, is located between amino acids 
301-334 (Fig. 1). 

mRNA Expression of NV-BR-1. NY-BR-1 mRNA expression 
was tested in a panel of 15 different normal tissues (adrenal gland, 
fetal brain, lung, manmary gland, pancreas, placenta, prostate, thy- 
mus, uterus, ovary, brain, kidney, liver, colon, and testis). RT-PCR 
analysis showed a strong signal in mammary gland and testis and a 
very faint signal in placenta. All other tissues were negative (Fig. 2A), 
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Fig. 1. Predicted amino acid sequence of NY-BR-I. A bipartite nuclear loealization 
signal motif is highlighted at amino aeid positions 17-34. Five tandem ankyrin repeats are 
located at positions 49-81, 82-114, 115-147, 148-180, and 181-213. A bZIP (DNA- 
binding site followed by a leucine zipper motif) is located at position 1077-11 CM. The 
peptide segment present only in one of the two splice variants, positions 973-1009, is 
underlined. Three additional repetitive elements were identified (amino acids 459-815, 
224-300, and 301-334; see text). 



NY-BR-1 expression in breast cancers and other tumors was ex- 
amined. Twenty-five breast cancer samples were tested, and 21 of 
them (84%) were positive by RT-PCR (17 showed strong signals, and 
the other 4 samples showed weak to moderate signals; part of the data 
are shown in Fig. 2B). Among 82 nonmammary tumor samples tested 
(36 melanomas, 26 non-small cell lung cancers, 6 colon cancers, 6 
squamous cell carcinomas, 6 transitional cell carcinomas, and 2 
leiomyosarcomas), only 2 melanomas showed NY-BR-1 expression. 

The expression of NY-BR-1 in tissue culture lines was also exam- 



ined in ceil lines derived from breast tumor, melanoma, and small cell 
lung cancer. Four of six breast cancer cell lines (two showed very 
weak signals), four of eight melanoma lines (2 very weak), and 7 of 
14 small cell lung cancer lines (2 very weak) were positive (data not 
shown). 

Chromosomai Localization and Exon-Intron Organization of 
NY-BR-1. Comparison of the NY-BR-1 sequences with the newly 
available working draft version of the human genome allowed the 
assignment of NY-BR-1 to chromosome lOpl 1.21-12.1, with at 
least three chromosome 10 clones showing sequence identity to 
NY-BR-1 (GenBank accession numbers AL157387, AL357148, and 
AC067744). 

A comparison of NY-BR-1 cDNA and genomic sequences also 
permitted the definition of NY-BR-l exon-intron organization. The 
amino acid coding region of this gene contains a basic framework 
of 19 structurally distinct exons, with at least two additional exons 
encoding 3 '-untranslated sequence. The detailed exon-intron junc- 
tion information is described in the GenBank entry (accession 
number AF269087). The six ankyrin repeats are encoded by exons 
2-6. Of great interest was the finding that the 357-nucleotide 
repeating unit in NY-BR-1 cDNA is composed of six exons, exons 
10-15. The available genomic sequences are incomplete in this 
region, and only one of the three copies of the 357-bp repeats in 
NY-BR-1 cDNA was identified. This finding suggests that the 
DNA segment between exons 10 and 15 were duplicated and 
inserted in tandem during evolution. In the isolated NY-BR-1 
cDNA clones, three complete copies and one incomplete copy of 
such repeating units are present. Thus, the predicted exon se- 
quences in NY-BR-1 can be expressed as exons 1-15-(10A-15A)- 
(I0B-15B)-(10C-13C)-16-21, in which A, B, and C are inexact 
copies of the exon 10-15 sequences. The NY-BR-1 cDNA, there- 
fore, is derived from a total of 37 exons; whether there are allelic 
differences in the copy number of this repetitive element (and thus 
the number of exons) in different individuals is currently unknown. 

The available genomic sequence (GenBank AC067744) also al- 
lowed us to extend the 5' sequence of this gene beyond the cloned 
NY-BR-1 cDNA. Translation of the 5' genomic sequence using the 
previously assigned NY-BR-I ORF led to the identification of a new 
translation initiation site 168 bp upstream to the previously predicted 
ATG initiation codon in NY-BR-1 cDNA (see text above and Fig. 1). 



Fig. 2. NY-BR-I mRNA expression by RT-PCR. Primers used were: 
5' primer BR I A. 5'-CAAAGCAGAGCCTCCCGAGAAG-3'; and 3' 
primer BR IB, and 5'-CCTATGCTGCTCTTCGATTCTTCC-3*. Of 15 
nomial tissues, RT-PCR showed strong signals in mammary gland and 
in testis, a very faint signal in placenta, and was negative in other tissues 
{a). Of 19 bnkst cancer specimens shown, 15 were positive, including 
three weak positives (breast tumors 3, 4, and 5; b). 
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Fig. 3. Comparison of the predicted amino acid sequences of NY-BR>1 and NY-BR-I. I. Identical sequences are shown as dots (— ), and gaps arc shown as dashes (- 
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If this newly identified ATG is the true initiation site used in vivo, the 
NY-BR-l polypeptide would contain 1397 amino acids, 56 residues 
longer than is depicted in Fig. 1 (additional NH2 terminus sequence: 
MEEISAAAVKVVPGPERPSPFSQLVYTSNDSYTVHSGDLRKIH- 
KAASRGQVRKLEK). 

Identification of NY-BR-l Splice Variants. Sequence compari- 
son of the six SEREX-defmed NY-BR-l clones revealed that they 
were derived from two different splice variants. One variant contains 
an additional coding sequence of 11 1 bp (nucleotide nos. 3015-3125 
of cloned NY-BR-l cDNA, encoding amino acids 973-1009; see Fig. 
1), which is absent in the other variant Comparison with the genomic 
sequence confirmed that this results from an alternate splicing event, 
with the longer variant incorporating part of the intron 33 into exon 34 
(i.e., exon 17 of the basic exon-intron framework described above). 
Key structural elements predicted above in the NY-BR-l sequence are 
present in both splice variants, suggesting no apparent difference in 
biological function or subcellular localization. 

The expression of these two splice variants was evaluated using 
primers specific to the larger variant, as well as primers spanning the 
alternatively spliced exon. In the normal tissues analyzed, both vari- 
ants showed strong expression in testis and breast by RT-PCR (but not 
in other tissues), with the larger variant being the dominant form in 
testis and the shorter variant dominant in breast. A selective group of 
10 breast cancer samples were typed for these two splice fomis, and 
results showed cotyping of the two variants (7 strong positive, 2 
weaker positive, and 1 negative), with the shorter variant consistently 
being the predominant form. 

Isolation of a NY-BR-l Homologue Gene. Screening testicular 
cDNA library with a NY-BR-l probe identified a cDNA encoding a 
new gene with homology to the NY-BR-I gene. This clone, 3673 bp 
excluding the poly(A) tail, corresponded to nucleotides 1-3481 of the 
NY-BR-l and showed 62% homology. A DNA database search re- 
vealed no sequence identity to GenBank "nr" database, and the new 
gene has been designated NY-BR-l, I (submitted to GenBank, acces- 
sion number AF269088). ORF analysis showed an ORF from nucle- 



otide 641 to the end of the cloned sequence, with 54% homology to 
the putative NY-BR-l protein sequence. The ATG initiation codon of 
NY-BR-l. 1 is preceded by a 640-bp 5 '-untranslated region with 
scattered stop codons. Comparison of the available NY-BR-l and 
NY-BR-l. 1 amino acid sequences is shown in Fig. 3. RT-PCR anal- 
ysis for NY-BR-1.1 showed a tissue-restricted mRNA expression 
pattern distinct from NY-BR-l. Among six normal tissues examined, 
NY-BR-1.1 showed strong RT-PCR signal in testis, moderate signals 
in brain and breast tissues, and was negative in kidney, liver, and 
colon (Fig. AA), Upon multiple repeated experiments, normal breast 
tissues showed either no or weak signals, consistently weaker than 
those observed in testis and often in brain, indicating a lower level of 
expression. NY-BR-1.1 expression was also examined in six breast 
cancer cell lines and 10 breast cancer specimens. One of six breast 
cancer cell lines was positive, in contrast to four of six for NY-BR-l. 
Eight of 10 breast cancer specimens were positive. In comparison with 
NY-BR-l expression, 6 were positive for both, 1 was positive for 
NY-BR-l only, 2 were positive for NY-BR-1.1 only, and 1 was 
negative for both (Fig. 45). The strong expression in brain and 
low-level expression in normal breast and the lack of correlation in 
NY-BR-l and NY-BR-1.1 expression in breast cancer lines and tis- 
sues indicate that these two gene products have a clearly distinct 
expression pattern. 

Genomic Sequence of NY-BR-1.1. Comparison of the NY-BR- 
1 . 1 sequence with the released working draft of the human genome 
revealed one clone with sequence identity (GenBank AL359312). 
This clone was prestmiably derived from chromosome 9, indicating 
that NY-BR-l and NY-BR-l. I reside on two different chromosomes. 
The AL359312 genomic sequence does not contain the entire NY- 
BR-1.1 cDNA sequence, precluding the definition of NY-BR-1.1 
exon-intron structure. However, at least three exons can be defined, 
which are the counterparts of the basic framework of exons 16, 17, 
and 18 in NY-BR-L The exon-intron junctions of NY-BR-l and 
NY-BR-1.1 are conserved in these exons. 
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Fig. 4. NY-BR-l.I mRNA expression by RT-PCR. Primcis 
used wcrc: 5' primer BR-l.lA, 5*-TCTCATAGATGCTGGTGCT- 
GATC-3'; and 3' primer BR MB, and 5'-CCCAGACATT- 
GAATTTTGGCAGAC-3'. Of six normal tissues tested, RT-PCR 
showed a strong signal in testis, moderate signals in brain and 
mammaiy gland, and negative in kidney, liver, and colon (a). Of 1 0 
breast cancer specimens, 8 were positive for NY-BR- 1 . 1 (b). Com- 
paring the NY-BR-I and NY-BR-I. 1 expression, 7 of 10 cotypcd 
(6 positive and 1 negative), whereas two were positive for NY- 
BR-I. I only (tumors 7 and 9), and one was positive for NY-BR-1 
only (tumor 5). 
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DISCUSSION 

The SEREX approach has proved to be a very powerful tool to 
identify tumor antigens (20-30). SEREX-defined antigens have been 
classified into several categories, including differentiation antigens, 
CT antigens, mutational antigens, amplified/overexpressed antigens, 
splice variant antigens, and retroviral antigens (3 1). The expression of 
NY-BR- 1 in breast tissue and testis but not in other normal tissues 
indicates that NY-BR- 1 belongs to the category of differentiation 
antigens. 

Differentiation antigens are antigens that show expression in spe- 
cific cell lineage(s) or at specific stages of differentiation in a partic- 
ular cell lineage(s) (32). This category of antigens has been best 
studied in cells of lymphoid and hematopoietic derivation, starting 
with the definition of mouse cell surface differentiation antigens of 
lymphocytes, such as TL (33, 34), Thy-1 (35), and Lyt-2 (36). 
Application of hybridoma technology to the analysis of human cells 
resulted in the identification of a broad range of differentiation anti- 
gens, and this has led to the classification system for CD antigens 
(37). Most of the initial CD antigens were restricted differentiation 
antigens expressed in lymphocytes and other hematopoietic cells, e.g., 
CDl through CDS primarily restricted to T cells (38). The expression 
of differentiation antigens in normal cells is generally preserved in 
their neoplastic counterparts, and this feature has made these antigens 
useful maiicers in the immunopathological differential diagnosis of 
cancer. The best example of this is again in the hematopoietic/ 
lymphocytic lineages, in which the antigenic profile of the neoplastic 
cells provided the foundation for the classification of leukemia/lym- 
phoma (39). In addition, these antigens can be targets for specific 
immunotherapy, and anti-CD20 antibody, recognizing a B-cell differ- 
entiation antigen, represents the first monoclonal antibody approved 
by the Food and Drug Administration for cancer immunotherapy (40). 

In addition to cells of hematopoietic origin, the melanocyte, a 
specialized cell type in the neuroectodermal lineage, has been found 
to express a number of well-characterized differentiation antigens, 
most of them associated with the melanin-synthesis pathway. Studies 
using polyclonal and monoclonal antibodies initially defmed tyrosin- 
ase (41), gp75 (42), and gplOO (43). Recent efforts to identify mela- 
noma antigens recognized by CD8+ and CD4+ T cells also identified 
these antigens as T-cell targets and further expanded the list of 
melanocyte differentiation antigens (44-48). Melan-A/MART-1, 



identified by transfection-based T-cell epitope cloning as a CD8+ 
T-cell target (48, 49), and Rab38 (50), identified by SEREX analysis 
of melanoma, are two further examples of melanocyte differentiation 
antigens identified through these efforts. Similar to their hematopoi- 
etic coimterparts, the melanocyte differentiation antigens have also 
been found useful in the clinical arena. Antibodies against gplOO and 
Melan-A/MART-1 are widely used to distinguish metastatic melano- 
mas from other metastatic malignancies (51), and melanoma vaccine 
trials targeting these antigens are being actively pursued (see Ref 52 
for an example). 

With regard to common epithelial tissue, a wide range of gene 
products with differential expression have been defined, e.g., cyto- 
keratins (53, 54), mucin-related antigens (55), and hormonal receptors 
(56). However, with rare exceptions, none of these are exclusively 
expressed in only a single epithelial cell type, such as breast epithe- 
lium. In this regard, NY-BR- 1 is of considerable interest because of its 
highly restricted expression pattern in normal tissue, i.e., breast and 
testis. The production of antibody probes for NY-BR- 1 is essential to 
confirm breast specificity at the protein and cell levels. Widi regard to 
cancer vaccine development, the restricted expression of NY-BR- 1 in 
normal breast and breast cancer makes it a highly attractive vaccine 
target However, the presence of a homologous gene, NY-BR-I. 1, that 
is expressed in brain is cause for concern, and it will be necessary to 
show that T-cell reactivity to NY-BR- 1 can be generated without 
cross-recognition of NY-BR- 1 . 1 . 

The predicted protein sequence of NY-BR- 1 contains a DNA- 
binding site and a leucine zipper motif (bZIP). The bZIP motif 
characterizes the superfamily of eucaryotic DNA-binding transcrip- 
tion factors that contain a basic region mediating sequence-specific 
DNA-binding, followed by a leucine zipper required for dimerization 
(57, 58). It is thus most likely that NY-BR- 1 is a transcription factor. 
Five ankyrin repeats are also present in the NY-BR- 1 protein. Ankyrin 
repeat proteins carry out a wide variety of biological activities and are 
found in both cytoplasm and nucleus. The repeat motif has been 
recognized in >400 proteins, including cyciin-dependent kinase in- 
hibitors, transcriptional regulators, cytoskeletal organizers, develop- 
mental regulators, and toxins (59). Thus, the ankyrin repeat in itself is 
not predictive of a specific cellular function or subcellular localiza- 
tion; rather, ankyrin repeats are thought to mediate a wide range of 
protein-protein interactions (59). In comparison to other ankyrin 
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repeat-containing proteins, NY-BR-1 is unique because of the other 
repetitive elements in its predicted protein sequence. These additional 
repetitive elements are not found in other sequences in the public 
protein databases, and their functional significance remains to be 
determined. 

By comparing the cDNA sequence with the recently released work- 
ing draft of the human genome, we were able to derive the following 
important information about NY-BR-J: (a) confirming the cDNA 
sequences of NY-BR-1 and NY-BR-1 .1; (b) mapping to chromosome 
lOpl 1-12 and NY-BR-Ll tentatively to chromosome 9; (c) definition 
of the exon-intron structure of NY-BR-l, a complex gene with 37 
exons, and correlate exon structure to repeating peptide units; and {d) 
conipletion of the NHj temiinus amino acid sequence of NY-BR- 1 
that had defied our multiple cloning efforts and RACE analysis. On 
the other hand, the cDNA sequences of NY-BR-l and NY-BR-L] 
genes from this study will certainly help the annotation of correspond- 
ing genome sequences. The current study thus provides a clear ex- 
ample of valuable data that can be achieved by interaction between the 
human genome project and other scientific fields. 
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■ __ Exhibit C 

FUNCTION 



BestFit makes an optimal^ alignment of the.best segment of simflarity between two sequences. Optima] 
alignments are found bymsertmg gaps to maximize the number of matches using the local homology ' 
algorithm of Smith and Waterman. *v ysj- 



DESCRIPTION 



BestFit inserte pps to obtain the optimal alignment of the best region of similarity between two 
sequences, and then displays the alignment in a format similar to the output from Gap. The sequences 
can be of veiy Afferent lengths and have only a small segment of similarity between them. You could 
take a short ENA sequence, for example, and run it against a whole mitochondrial genome 



SEARCHING FOR SIh/IILARITY 



BestFit is the most powerful method in the Wisconsin Sequence Analysis Package'"' for identifying the 
best region of smailanty between two sequences whose relationship is unknown. 



EXAMPLE 



The sequence gainma.seq contains an Alu femily sequence somewhere in the first 500 bases. alu.seq 
contams a g^enc human Alu family repeat The two sequences are aligned and the best segment of 
similanty is found with BestFit 

% bestfit 

BESTFIT of what sequence 1 ? gamma. seq ' 

Begin (* 1 *) ? 
End (* 11375 *) ? 500 
Reverse (* No *) ? 

to what sequence 2 (* gainma.seq *) ? alu. seq 

Begin (* 1 *) ? 
End {* 207 *) ? 
Reverse (* No *) ? 

What is the gap creation penalty (* 5.00 *) ? 

Whar is the gap extension penalty 0.30 *) ? 

What should Z call uhe paired output display file (* gamma. pair *) 
Aligning 

3 



. Gaps 
Qualify 
Quality ?.ari: 



12?. 3 

'3 4 . 4 ? i 



% 
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Comparison 

OUTPUT 

Here is the output file. Notice how. BestKt finds and displays only the best segments of similarity: 
BESTFir of: gamma. seq check: 6474 from: 1 to: 500 

Human fetal beta globins G and A gamma 

from Shen, Slightom and Smithies, Cell 26; 191-203 

Analyzed by Smithies et al. Cell 26; 345-353. 

to: alu.seq check: 4238 from: 1 to: ■207 

HSREP2 from the EMBL data library 

Human Alu repetitive sequence located near the insulin gene 
Dhruva D R., Shenk T., Subramanian K.N.; "Integration in vivo into 
Simian vxrus 40 DNA of a sequence that resembles a certain faxnSy of 
???^4™Slor "^r''^ ; Proc. Natl. Acad. Sci. V 

C^SecrsMr" ""^^"^ ^"-<^-^:I<=cgcore.Data.Rundata]Swgapdna.Cnp 

Gap Weight: 5-000 Average Match: 1.000 

Length weight : 0.300 Average Mismatch: -0.900 

Quality: 129.3 Length: 209. 

Ratio: 0.625 Qaps: 3 

Percent Similarity: 84.466 Percent Identity: 84.466 , 

gamma. seq x alu.seq June 20, 1994 15: 15 •; . 

137 AGACCAACCTGGCCAACATGGTGaAArCC^TCTCTAC.^WTACAA^ 185 

, 'f"" ii:iiiiiimmiiM_ 1 1 1 1 u 1 1 1 1 iTim 

1 AGACCAGCCTGGCCAACAXGGTGAAACTCCATCTCTACirGaAAATACAAA 50 

186 AATTAGACAGGCATGATGGCAAGrGCCTGiAATCCCAGciACXTGGGAG^ 235 

"""J'llllll II ^lIllllLllllimilllll Mill 

51 AATTASCCAGGOVTd&TC^SiG&rGCCrGeAATCCcLL:^^^^ 100 

236 CTGAGGAAGGAGAATTGCrrGA&CCrGGALsGCAGGAGTTGCAGTGAGCC 285 
nil',, li JIIII_JII_IIII I ill I 1111,1, 

101 CTGACSiaiG.:^GaArCCCTrAAACc£^G?AGG?a(£SGTTGC^^^ 149 
286 GAGAXCATACCACTGCACrCC.\GCCTGGGTGaCAGAACA^GAGrCTGrCT 335 

'"'"r-."--"''IIIIIIIIIIIJMIIIIlK|- 11,111 ■ill 
150 GAGAXCGa^.CGGCTGCACXCCAGCCTLGGTGACAGA^5A(^CTcSTCT^ 198 

336 CAA-aAAAAA 344 

: j • • • . 
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RELATED PROGRAMS 



When you want an alignment that covers the whole length of both sequences, use Gap. When you are 
trying to find only the best segment of similarity between two sequences, use BestFit. PileUp creates a 
multiple sequence alignment of a group of related sequences, aKgning the whole length of all sequences. 
DotPlot displays the entire surface of comparison for a comparison of two sequences GapShow 
displays the pattern of differences between two aligned sequences. PlotSimilarity plots the average 
similarity of two or more aKgned sequences at each position in the alignment. Pretty displays 
alignments of several sequences. lineUp is an editor for editing multiple sequence aUgnments 
CompTable helps generate scoring.matrices for peptide comparison. 



ALGORITHM 



BestPit uses the'.focoZ AomoZogy algorithm of Smith and Waterman (Advances in AppKed 
Mathematics 2; 482-^ (1981)) to find the best segment of similarity between two sequences. BestFit 
reads a scoring matrix that contains values for eveiy possible GCG symbol match (see the LOCAL 
DATA FILES topic below). The program uses these values to construct a path matrix that represents 
the entire surface of comparison with a score at every position for the best possible alignment to that 
point The quality score for the best alignment to any point is equal to the sum of the scoring matrix 
values of the matches in that alignment, less the gap creation penalty times the number of gaps in that 
alignment, less the gap extension penalty times the total lengtii of all gaps in that alignment The gap 
creation and gap extension penalties are set by you. If the best path to any point has a negative value, 
a zero is put in that position. 



After the path matrix is complete, the hig^iest value on the surface of comparison represents the end of 
the best region of similarity between the sequences. The best path firom this highest value backwards 
to the point where the values revert to zero is the alignment shown by BestPit This alignment is the 
best segment of similarity between the two sequences. 

For nucleic adds, the default scoring matrix has a jnatch value of LO for each identical symbol 
comparison and -0.90 for eadi non-identical comparison (not considering nucleotide ambiguity symbols 
for this example). The quality score for a nucleic add alignment can, therefore, be determined using 
the following equation: 

Quality = 1.0 x TotalMazclies + -0.90 x TotalMismatches 
- (GapCreationPenalty x GapNuinber) 

- (GapExtensionPenalty x Tot alLengtbOf Gaps) 

The quality score for a protein alignment is calculated in a similar manner. However, while the default 
nuddc add scoring matrix has a single vailue for all non-identical comparisons, the default protein 
scoring matrix has different values for the varioTis non-identical amino add comparisons. The quality 
score for a protein alignment can therefore be determined using the following equation (where Total 
is the total number of A-A (Ala-Ala) matches in the alignment, Cn^Val is the value for an A-A 
comparison in the scoring matrix, Toral^ is, the total number of A-B lAla-Asx) matches in the 
alignment, CmpVal^ is the value for an A-B comparison in the scoring matrix, ...) : 

Qualiry = CmpVal^ x Total 
CmpVal^ X Tocal^ 

- CzipVal^ X Total" 

AC jc 



^. " 

- ^^apCreazi.on?enalty x GapNuinber) 

- (3ap£xter.sio.^^enalry x TotalLengthOf Gaps) 



For a more complete discussion of scoring matrices, see the Data Files manual. 



5-22 BesfFit 
CONSIDERATIONS 



Comparison 



BerfRt Always Rnds Something 

BestFit always finds an alignment for any two sequences you compare - even if there is no 
sxgmficant similarity between feemJ You must evaluate the results critically to decid?i?the 
segment shown is not just a random region of relative similarity. 

The Segments Shovim Obscure Altemolive Segments 

BestPit only shows one segment of similarityrso if there are several, all but one is obscured You 
can approach this problem with graphic matrix analysis (see the Compare and DotPlot 
programs). Alternatively you can run BestFit on ranges outside the ranges of similarity fomS 
m earher runs to bring other segments out of the shadow of the best segmit 

The Best Fit b Only One Member of a Family 

i^™!?' algorithms, the ^ment displayed is a member of the family of best 

ahgnments. This family may have other members of equal quaKty, but wiU not have amv 
member wife a higher quality. The femily is usually ^ga^^^r^frm^^ZZ 
of gap creation and gap ^sion penalties. See the CONSIDERATIONS topic in the entnr for 

fa^SS^nStit' " *° *° -d 

The Surface of Comparison 

J^^f'^^"^!/*^^ T^^'S^^ ^? ^ proportional to the area of the surface of comparison. 
That area K deteimmed by the product of the lengths of the two sequences compared. BestPit 
can evaluate a surface of up to 3.5 milHon elements. This suifece would be 1^ enoljh to 
compare two sequences appimimatdy 1,870-symbols long, or one sequence 200-^ok long 
Sir. r "^"^^"^ 17,500-symbols long. When you have much longer sequen^timt Sf 
^d^ntiy command-hne option -lOMit to use the surface more 

The Public Scoring Matrix for Nucleic Acid Comparisons is Very Stringent 

T^^r^J^ ™^ swgapdna.cmp penalizes mismatches -0.9 so the segments found may be very 
^ ^"t^Si^^ "^^^ ^^^^ be extended ^three b^^^pki^ 

extra match. The s^rmg matrix used by Smith and Waterman, when local aHgmnents were first 
descnbed^ used -0.333 for the mismatch penalty. You can use Fetch to copy r^mdna.mpS 
p^ at ^'^^""P ^ ^^^^ ^ nwsgapdna.cmp, which has no mis4toh 

Rapid Alignment 

When possible, BestFit tries to find the optimal alignment very qmckly. If this rapid alignment 
is not unambiguously optimal. BestPit automatically realigns the sequences to calSate the 
optimal alignment When tiiis occurs, the monitor of alignment progress on your terminal screen 
U'--^igr.:Lr.g-. . . ; is displayed twice for a single alignment 

ALIGNING LONG SEQUENCES 

This program car. align ver.- long sequences if you know roughly where the alignment of interest 
be^ns. Run tiie program wnth the command hne option -liMir. Then set the starting coordinates for 
set;.tTX T:^"'"^? ^ aHgnment of interest begins and set gap shift limits on each 

. sequence. I^e program dien ahgns the sequences from your starting point such that the sequences do 
not get out of phase by more than the gap shift limits you have set. If you started both sequences at 
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■base nujiber one and set the gap shm Kmit for sequence one to" 100 and for sequence two to 50 fK.. 

ce ;:o~^ ^^^^'^ ^ -^^^^ rrto'4k?o^ 



whether ^.ealign^e^t 00^^^^^^^^ 

gaps m each sequence. If .the program, cannot rule out this possibility, it dispkysTe me^~ 
*** Alignment zs not guaranteed ..to-be optimal***. Because the criteria use? 
ol^i :v.n"«'^'"*''"^ an opfcin^^aUgnment are've^ stringent, a Kxniid a^^^t o^n xnayt 
optimal even rfthis message is displayed. In any event, the program continues to (^leti^ ^ 

EVALUATING AUGNMENT SIGNIRCANCE 

^hS'^S T the significance of the aligmnent. using a simple statistical 

• '^v^'^i^"^ ^« «P«°«- second sequence is r^eateS 

length and composition, and then reaKgned to the first se^ienl The a^™ 
ahgom^t score, phis or mmus the standard deviation, of aU randomized aKgmnenS is Tepor^ iTtS 
^1 7Z T^^' ttos average quality score to the quality score rfflie acSi S^^t t^ 
hdp equate the agmficance of the alignment The number of randomizations can iSlSd aJon^ 
with the -RMidomizations command line qualifier; the default is 10. ^ ^^^S 

^e score of each randomized alignment is reported to the screen. You can use <ctn>c to interrunt 

Jtirt^^^e^^e^MS Z^%T^' rrsS ^Tanl 

n^aSS.^'""^^- ^^^^--^Cl9S4))^radiscus^or^S;Si/^^^ 

ALIGNMENT METRICS 

BestFit and Gap display four figures of merit for alignments: Quafily, Katio. Identity, and Similarity. 

The Qu^ty (d^cribed above) is the metric maximized in order to align the sequences Ratio is the 
^ divided ^ the number of bases in ihe shorter segment PercenUdentityT^e pe^^ ^f Sei 

f W ^^>S ^^J^^ are ignored. A similarity is scored when the scoring matrix value for 

a p^r rf s^Tobols is greater than or equal to 0.50, the simOarity threshold. This tSold is 

.^Wtiie™^^^^^^ For instance, the expression 

^^J^^i^"^::^ <^^-t ,ro^o^ so they should not 6e used 

PEPTIDE SEQUENCES 

I'll r7'rn3rT "^f^^ ^^^^ences, this program uses a scoring matrix with matches scored 
h iTt ^ f '° evolutionary distance between the amino adds as 

eTiTes ?986 ^ "o^^alized by Gribskov VGribskov and Burgess Nucl. Acids. Res. 14(16), 
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Input seqaences may not be more than SO.OOO-syinbols long. This program cannot evaluate a ^,rf,.« 
companson larger than 5.5 miffion elements. A 200x27,500 cLSrisonTposSk L 
2.300x2,300 comparison. See the ALIGNING LONG SEQUENOEsTpk for help ^ S^^l f ^ 
sec^ences that would normaDy exceed the maximum surface of compSn. Yofc^Sf^f ^t^? 
meSo^!^'^'" "^^'^ comparison if your ^stem has emfv^ 

SEQUENCE TYPE • 

The fimction of BestFit depends on whe^^^ Normally 
Retype of a sequence IS deter^^^ N<.r Type: P on the lastlSe of 

tn!^- w'^'^T^ ^t^T'^f If your sequence(s) are not the correct type, t,^ to 

Appendix VI for information on how to change or set the iype of a sequence. , xniin to 

COMMAND-UNE SUMMARY 

All parameters for this program may be put on the command Kne. Use the option -CHEck to see the 
^i^naxy below and to W a dhance to add things to the command line before the program 
In the summary below, the capitahzed letters in tiie qualifier names are the letters iLt you m«S?S 
m order to use the panuneter. Square brackets ([ and D enclose qualifiers or pai^eter XS^St^e 
& ^"thTu^^^^r • ^ Chapter^asic Concepts: u£l 

Minimal Syntax: %,bestfit [-INfilel-Jgaitima.seq [-INfile2-] alu.seq -Default. 

Proinpted Parameters: 



-BEGial^l -BEGin2=l 
-END1=500 "SND2=207 
-NOREVl -N0REV2 
-GAPweight=5 . 0 
-LENgrthweighr=0 . 3 
[-OUTfilel=] gamma, pair 



beginning of each sequence 
end of each sequence 
strand of each sequence 

gap creation penalty (3.0 is protein defaiilt) 
gap extension penalty (0.1 is proteiii default) 
output file for alignment 



Local Data Files: -DATa=swgapdna . crtro 

-DATa=swgappep , cnqp 



scoring matrix for nucleic acids 
scoring matrix for peptides 



Optional Parameters: 

"OUTf ile2«gamma . gap 
-0UTfile3«alu,gap ' 
-LIMitl«499 -LIMiz2=206 
-RANdomizations [=10] 

-PAIr=1.0, 0.5,0.1 

-WIDth=50 

-?AGe=60 

-N03I3Gaps 

-HIGhr:>ad 

-LOWroad 

-MCSUMmar-/ 



new sequence file for sequence 1 with gaps added 

2 " " 

limit the surface of conparison 
determine average, score 'from 10 randomized 
alignments 

thresholds for displaying ' | ' , ' : ' , and ( . ' 
the number of sequence symbols per line 
adds a line with a form feed every 60 lines 
suppresses abbreviation of large gaps with '.'s 
niakes uhe zor> alignment for your parameters 
makes the bottom alignment for your parameters 
suppresses rhe screen summarv 
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LOCAL DATA FILES 

ffles described below supply auxiliary data to this program. The program automaticaDy reads 
them from a pubbc data directory unless you either 1) have a data file with exactly the same name in 
^ifn.'?!I!!L-r'"?^ irectoiy. or 2) name a file on the command line with an expression like 
-l)M!al=nrsrf lie. dat. For more information see Chapter 4, Using Data PUes in the User's Guide. 

ISL^^^TSS^" S«T S ^S'^'^T,^'^^ ^'^'^S matrix in the pubKc file 

swgapdna.cmp. (SW stands for Smith and Waterman.) If the first sequence you name is a peptide 
sequence, BestFit reads swgappep.cmp instead. The presence of these files in your current woAinff 
directoiy cau^s BestFit to read your version instead. (See the Data Files manual for more 
inrormation about scoring matrices.) 

OPnONAL PARAMETERS 

The parameteis and switches listed below can be set from the command line. For more information, 
see "Usmg Program Parameters in Chapter 3, Basic Concepts: Using Programs in the User's Guide. 

-LIMitl=20 and -LlMit2=20 

let you set gap shift limits for each sequence. When you afready know of a long similarity 
betwewi two sequences you can "zip" them together using this mode. The beginning coordinates 
for each sequaice must be near the beginning of the aHgnment you want to see. The aKgnment 
continues so that gaps mserted do not require the sequences to get out of step by more than the 
gap shift Imuts. You c^align veiy long sequences rapidly. The surface of comparison is stSU' 
limited to 3.0 miDion. The size of a comparison can be predicted by multiplying the average 
length of the two sequences by the sum of the two shift limits. 

If you add to the command line without any qualifier value, the program prompts you to 

enter gap shift hmits for each sequence. 

-RaNdomi2arions=10 

reports the average alignment score'imd standard deviation from 10 randomized ahgnments in 
which tiie second sequence is repeatedly shuffled, maintaining the length and composition of the 
ongmal sequence, and then aHgned to the first sequence. You can use the optional parameter to 
set the number of randomized alignment to some number other than 10. 

-0TCfi:e2=seqnaaael.gap -ODTfils3=seqnaine2.gap 

This orofram can write three different output files. The first disT)lavs the aKgnment of sequence 
one *-tn sequence two. The second is a new sequence file for sequence one, possibly expanded bv 
gaps to make it ah^ with sequence two. The third, like the second, is a new sequence file for 
sequence two, possibly expanded by gaps to make it align with sequence one. The program 
writes only xhe first file unless there are output file options on the command line. If there are 
any output files named on the command line, only those output files are written. If you add 
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-OUT to the cotomand Ime wi&out any quali^ing filename,- thai the pro^ will imte -thp 
second and tiurd output files after prompting you for their names. 

Aligned sequences (in sequence files) can be displayed with GapShow. Iheir similarity can be 
displayed with PlotSimilanty. ' 

-PAIr=1.0,0.5,0.1 

The paired output file, from this program displays sequence similarity hy printing one of three 
characters between simJar sequence symbols: a pipe chaiacter( I ), a colon (:), or a period ( ) 
Normally a pipe character is put between symbols that are the same, a colon is put between 
syn^ls whose comparison value is greater than or equal to 0.50, and a period is put between 
gmibols whose comparison value is greater than or equal to 0.10. You can change these Tnatch 
fif v^.r?^.? command line. The tiiree parameters for -Pair are the display 

thresholds for the pipe ch^cter, colon, and period. The match display criterion for a pipe 
charac^dianges from qrmbohc identity (the default) to the quantitative threshold you have set 
m the &st parameter. A ppe character win no longer be inserted between identical symbols 
xmless their comparison values are greater than or equal to this threshold. If you still want a 
pipe dbaracter to connect idwitical symbols, use x instead of a number as the first parameter 
(See the Data Files manual for more information about scoring matrices.) 

-PJU3e=64 

Whaa yau print the output from this program, it may cross from one page to another in a 
frostratog way - espeaally when you print on individual sheets. This option adds form feeds to 
the output file m order to tiy to keep clusters of related information together. You can set the 
number of hnes per page by supplying a number after the -PAGe qualifier. 

-KlDth=50 

puts 50 sequence symbols on each line of the output file. You can set the width to anythingfrom 
10 to 150 symbols, 

-NOBIGGaps 

suppresses large gap abbreviatdons, showing aD the sequence characters across from large gaps 
UsuaUy, gaps ttat extend one sequence by more than one complete line of output are abbreviated 
with three dots arranged in a vertical line. 

-LOWroadand -HIGhroad 

The insertion of gaps is, in many cases, aititraiy, and equally optimal aHgnments can be 
generated by mserting gaps differently. When equally optimal alignments are possible/this 
program c^ msert the gaps differently if you select either the -LOffroad or the -HiGhroad 
options. Here are examples for the ahgriment of GACCAT with GACAT with different 
parameters. 

For: Match = 1.0 MisMatch = -0.9 

Gap weighr = 1.0 Lengrh Weight = 0.0 



lrw?,:: = i: 1 3ACCilT 6 
1 SA.CAT 5 

Hxcrr-Rcad: 1 GACCAT 6 

Ml 1! 



Qualizv =4.0 



Quaiitv =4.0 



.eomparison 
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For:- Match = 1.0 

Gap weight =3.0 



HighRoad: 



LowRoad: 



1 GACCAT 6 
I I I 

1 GACAX. 5 

1 GACCAT 6 

• .III 
1 -GACAT 5 



MisMatch 
Length Weight 



Quality =3.0 



Quality = 3.0 



0.0 
0,0 



Ess^y the too; road shifts aH of the arbitrary gaps in sequence two to the left and aU of the 
arbtraiy gaps m sequence one to the right. The high road ^o^^c^j the oppos^tT Xfne^er 



-SXJKfenary 



writes a sammaxy of the program's work to the screen ^en you've used the -Default tmaMi»r fn 
^f^Zf interaction. A suminaiy typi^ displays at theend of f prSS 

interactrvely. You can suppress the sununaiy for a program rnnltiactively with -NO^^ 

U^s qualifier also to include a smmnaiy of the program's work in tiie log file for a program run in 
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